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<210> 1 
<211> 428 
<212> DNA 

<213> Trichoderma reesei 
<220> 

<221> intron 
<222> C167) . . (236) 

<220> 

<221> intron 
<222> (323) . . (386) 

<220> 

<221> misc__f eature 
<223> Coding sequence 

<400> 1 

atgaagttct tcgccatcgc 
gaccgcagca acggcaacgg 
tgtgccaccc aagtccttgg 
aacataagaa tcctcttgac 
ccagaacgtt tacgacggca 
tctctgctgc gtggcccccg 
aacccattct gacacccaga 
gtgcttga 




of hfbl 



cgctctcttt gccgccgctg 
caatgtttgc cctcccggcc 
cctcatcggc cttgactgca 
ggaaatatgc cttctcactc 
ccgacttccg caacgtctgc 
ttgtaagttg atgccccagc 
ctgcaggccg gccaggctct 



ccgttgccca gcctctcgag 60 
tcttcagcaa cccccagtgc 120 
aagtccgtaa gttgagccat 180 
ctttacccct gaacagcctc 240 
gccaaaaccg gcgcccagcc 300 
tcaagctcca gtctttggca 3.60 
tctgtgccag accgccgtcg 420 

428 



<210> 2 
<211> 78 
<212> DNA 

<213> Artificial Sequence 




<220> 

<223> Description of Artificial Sequence: PGR 5' primer 



<400> 2 

tcgggcacta cgtgccagta tagcaacgac tactactcgc aatgccttgt tccgcgtggc 60 
tctagttctg gaaccgca 78 



<210> 3 
<211> 30 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: PGR 3* primer 
<400> 3 

tcgtacggat cctcaagcac cgacggcggt 30 



<210> 4 
<211> 63 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: PGR 5' primer 
<400> 4 

actacacgga ggagctcgac gacttcgagc agcccgagct gcacgcagag caacggcaac 60 
ggc ■ ■ 63 



<210> 5 

<211> 2211 

<212> DNA 

<213> Trichoderma reesei 



<220> 

<221> promoter 

<222> (1) . . (2211) 

<223> cbhl promoter sequence 



<400> 5 

gaattctcac 

acctccatta 

tagattgggg 

ggtcggcaac 

tccaggaacc 

tattcgccct 

gtgtgtcttc 

tccgagctgt 

tgcatcatgt 

tggtcatcaa 

agaactggat 

tctattcaaa 

agttgtgaag 



ggtgaatgta 
cgcctccccc 
agaagttgac 
ggcaaaaaag 
tggatacatc 
aaaccgaagt 
tctaggtgca 
aactacctct 
atataatagt 
acaaagaacg 
acttgttgtg 
caccaagctt 
tcggtaatcc 



ggccttttgt 
atagagttcc 
ttccgcccag 
cacgtggctc 
catcatcacg 
gcgtggtaaa 
ttctttcctt 
gaatctctgg 
gatcctgaga 
aagacgcctc 
tcttctgtgt 
gctcttttga 
cgctgtatag 



agggtaggaa 
caatcagtga 
agctgaaggt 
accgaaaagc 
cacgaccact 
tctacacgtg 
cctctagtgt 
agaatggtgg 
aggggggttt 
ttttgcaaag 
atttttgtgg 
gctacaagaa 
taatacgagt 



ttgtcactca 
gtcatggcac 
cgcacaaccg 
aagatgtttg 
ttgatctgct 
ggcccctttc 
tgaattgttt 
actaacgact 
ggagcaatgt 
ttttgtttcg 
caacaagagg 
cctgtggggt 
cgcatctaaa 



agcaccccca 60 
tgttctcaaa 120 
catgatatag 180 
cgatctaaca 240 
ggtaaactcg 300 
ggtatactgc 360 
gtgttgggag 420 
accgtgcacc 480 
gggactttga 540 
gctacggtga 600 
ccagagacaa 660 
atatatctag 720 
tactccgaag 780 



* 



ctgctgcgaa 
catgaaaggc 
acaagcaaag 
taagtagcga 
caatgcaggg 
ggcgtttccc 
ggacgtgttt 
tgaccgactg 
aggcatgttg 
caccgatagc 
caaaccaatg 
taattgtaca 
aagcaacggc 
gatcccccaa 
gtctgactcg 
acattcaagg 
cgatacgacg 
ggcactgaac 
ggctttgggt 
tgctgccttt 
ggtttcgaat 
aacgaaatga 
gcctccctca 
catcttttga 



cccggagaat 
tatgagaaat 
cgttccgtcg 
tggaaccgga 
gtactgagct 
tgattcagcg 
tgcccttcat 
gggctgttcg 
tgaatctgtg 
agtgtctagt 
gctaaaagta 
atcaagtggc 
aaagcccact 
ttgggtcgct 
gagcgttttg 
agtatttagc 
aatactgtat 
aggcaaaaga 
gtacatgttt 
accaagcagc 
agaaagagaa 
gctagtaggc 
tgctctcccc 
ggcacagaaa 



cgagatgtgc 
tctggagacg 
cagtagcagg 
ataatataat 
tggacataac 
tacccgtaca 
ttggagaaat 
aagcccgaat 
tcgggcagga 
agcaacctgt 
cataagttaa 
taaacgtacc 
tcccacgttt 
tgtttgttcc 
catacaacca 
cagggatgct 
agtcacttct 
ttgagttgaa 
gtgctccggg 
tgagggtatg 
gcttagccaa 
aaagtcagcg 
atctactcat 
cccaatagtc 



tggaaagctt 
gcttgttgaa 
cactcattcc 
aggcaataca 
tgttccgtac 
agtcgtaatc 
aatgtcattg 
gtaggattgt 
cacgcctcga 
aaagccgcaa 
tgcctaaaga 
gtaatttgcc 
gtttcttcac 
ggtgaagtga 
agggcagtga 
tgagtgtatc 
gatgaagtgg 
actgcctaag 
caaatgcaaa 
tgataggcaa 
gaacaatagc 
aatgtgtata 
caactcagat 
aaccgcggac 



ctagcgagcg 
tcatggcgtt 
cgaaaaaact 
ttgagttgcc 
cccacctctt 
actattaacc 
cgatgtgtaa 
tatccgaact 
aggttcacgg 
tgcagcatca 
agtcatatac 
aacgcgttgt 
tcagtccaat 
aagaagacag 
tggaagacag 
gtgtaaggag 
tccatattga 
atctcgggcc 
gtgtggtagg 
atgttcaggg 
cgataaagat 
tataaaggtt 
cctccaggag 
tgcgcatcat 



gctaaattag 840 
ccattcttcg 900 
cggagattcc 960 
tcgacggttg 1020 
ctcaaccttt 1080 
cagactgacc 1140 
tttgcctgct 1200 
ctgctcgtag 1260 
caagggaaac 1320 
ctggaaaata 1380 
cagcggctaa 1440 
ggggttgcag 1500 
ctcagctggt 1560 
aggtaagaat 1620 
tgaaatgttg 1680 
gtttgtctgc 1740 
aatgtaagtc 1800 
ctcgggcttc 1860 
atcgacacac 1920 
gccactgcat 1980 
agcctcatta 2040 
cgaggtccgt 2100 
acttgtacac 2160 
g 2211 



<210> 
<211> 
<212> 
<213> 



6 

1588 
DNA 

Trichoderma 



reesei 



<220> 

<221> misc_feature 

<223> T. reesei egll cDNA 



<400> 6 

cccccctatc 

accacggcca 

cccgaggtcc 

caggacacct 

tcgtgcaccg 

aagaactgct 

agcctcacca 

cggctgtatc 

ctgagcttcg 

tctcagatgg 

agcggctact 

agccaccagg 

gccttgaccc 

ccctatggca 

accttcacca 

agcatcaccc 

gacaccatct 

gccctgagca 

aactggctcg 

atcctggcca 

gggtctacta 



ttagtccttc 
tcctggccat 
atcccaagtt 
cggtggtcct 
tcaacggcgg 
tcatcgaggg 
tgaaccagta 
tcctggactc 
acgtcgacct 
acgagaacgg 
gcgatgctca 
gcttctgctg 
ctcactcttg 
gcggctacaa 
tcatcaccca 
gcaagtacca 
cgtcctgccc 
gcggcatggt 
acagcggcaa 
acaaccccaa 
cgaactcgac 



ttgttgtccc 
tgcccggctc 
gacaacctac 
tgactggaac 
cgtcaacacc 
cgtcgactac 
catgcccagc 
tgacggtgag 
ctctgctctg 
gggcgccaac 
gtgccccgtc 
caacgagatg 
cacggccacg 
aagctactac 
gttcaacacg 
gcaaaacggc 
gtccgcctca 
gctcgtgttc 
cgccggcccc 
cacgcacgtc 
tgcgcccccg 



aaaatggcgc 
gtcgccgccc 
aagtgtacaa 
taccgctgga 
acgctctgcc 
gccgcctcgg 
agctctggcg 
tacgtgatgc 
ccgtgtggag 
cagtataaca 
cagacatgga 
gatatcctgg 
gcctgcgact 
ggccccggag 
gacaacggct 
gtcgacatcc 
gcctacggcg 
agcatttgga 
tgcagcagca 
gtcttctcca 
cccccgcctg 



cctcagttac 
agcaaccggg 
agtccggggg 
tgcacgacgc 
ctgacgaggc 
gcgtcacgac 
gctacagcag 
tgaagctcaa 
agaacggctc 
cggccggtgc 
ggaacggcac 
agggcaactc 
ctgccggttg 
ataccgttga 
cgccctcggg 
ccagcgccca 
gcctcgccac 
acgacaacag 
ccgagggcaa 
acatccgctg 
cgtccagcac 



actgccgttg 
taccagcacc 
gtgcgtggcc 
aaactacaac 
gacctgtggc 
ctcgggcagc 
cgtctctcct 
cggccaggag 
gctctacctg 
caactacggg 
cctcaacact 
gagggcgaat 
cggcttcaac 
cacctccaag 
caaccttgtg 
gcccggcggc 
catgggcaag 
ccagtacatg 
cccatccaac 
gggagacatt 
gacgttttcg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
.660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



3 



actacacgga 
cagtgcggtg 
tatagcaacg 
gacgggggca 
aagacatgct 
agcaaaaaaa 



ggagctcgac 
gcattgggta 
actactactc 
cgatagaatg 
atgttgtatc 
aaaaaaaaaa 



gacttcgagc 
cagcgggtgc 
gcaatgcctt 
cgggcacgca 
tacattagca 
aaaaaaaa 



agcccgagct 
aagacgtgca 
tagagcgttg 
gggagctcgt 
aatgacaaac 



gcacgcagac 
cgtcgggcac 
acttgcctct 
agacattggg 
aaatgaaaaa 



tcactggggg 
tacgtgccag 
ggtctgtcca 
cttaatatat 
gaacttatca 



1320 
1380 
1440 
1500 
1560 
1588 



<210> 
<211> 
<212> 
<213> 



7 

745 
DNA 

Trichoderma 



reesei 



<220> 
<221> 
<222> 
<223> 



terminator 
(1) . . (745) 
T. reesei cbhl 



terminator 



<400> 7 

ggacctaccc 

tgcgccagcg 

tccgtgcgaa 

ggagctacat 

atacggtcaa 

ttggcaaatt 

taacggaata 

cccgtagaat 

caatgttgat 

cgaacgcggt 

gcaacagtgg 

tggaaagcac 

acgatgacaa 



agtctcacta 
gcacaacttg 
agcctgacgc 
ggccccgggt 
ctcatctttc 
gtggctttcg 
gaagaaagag 
cgccgctctt 
attgttccgc 
agtggctgct 
aaattagtgg 
tgttggagac 
cgtagccgag 



cggccagtgc 
ccaggtcctg 
accggtagat 
gatttatttt 
actggagatg 
aaaacacaaa 
gaaattaaaa 
cgtgtatccc 
cagtatggct 
gccaattggt 
cgcaataatt 
caacttgtcc 
gaccc 



ggcggtattg 
aacccttact 
tcttggtgag 
ttttgtatct 
cggcctgctt 
acgattcctt 
aaaaaaaaaa 
agtaccacgt 
ccacccccat 
aatgaccata 
gagaacacag 
gttgcgaggc 



gctacagcgg 
actctcagtg 
cccgtatcat 
acttctgacc 
ggtattgcga 
agtagccatg 
aacaaacatc 
caaaggtatt 
ctccgcgaat 
gggagacaaa 
tgagaccata 
caacttgcat 



ccccacggtc 60 
cctgtaaagc 120 
gacggcggcg 180 
cttttcaaat 240 
tgttgtcagc 300 
cattttaaga 360 
ccgttcataa 420 
catgatcgtt 480 
ctcctcttct 540 
cagcataata 600 
gctggcggcc 660 
tgctgtcaag 720 

745 



<210> 8 
<211> 10 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: annealed primer 
<400> 8 

taaccgcggt 10 



<210> 9 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: annealed primer 



<400> 9 

ctagaccgcg gttaat 



16 




<210> 
<211> 
<212> 
<213> 

<220> 
<221> 
<222> 
<223> 



10 

1232 
DNA 

Trichoderma 



reesei 



promoter 
(1) . . (1232) 
T. reesei gpdl 



promotor 



<400> 10 

gtcgacacga 

atgtgacaat 

caccaaggtg 

gttagtgaag 

cagtacgtca 

gattgcctca 

agattgagag 

ccccagccgg 

tgcttggcaa 

gcagttgcta 

taggatggcg 

aagttgatga 

gatgccgcat 

ccactatctc 

gtgttaatct 

cttcctcctc 

agcctcgcaa 

gcccgacgat 

gccccgccag 

tccctctcgt 

cacgcagctc 



tatacaggcg 
attgactgcg 
catgcatcat 
gcaccgtgat 
agtaacgtca 
atcacatagt 
ctaccttaca 
cgcgattgtg 
tcgtgctcag 
aggtacctag 
gctggcgggg 
taccgttttg 
caccgttgtg 
aggtaaccag 
ttccacctcc 
ttcttctcca 
atcctcagtt 
gcggcttctg 
gactctcctg 
ttcttcctaa 
aattcgcaga 



cggctgatga 
agga'accatc 
aaggattctc 
ggcaagatag 
aagccgccct 
acctacctat 
tcaatatggc 
tgtactaggt 
ctgttaggtt 
ggagggattc 
ccgaagctgg 
ccagattaga 
tcagaaacga 
gtactaccag 
ctcctcaatc 
taaccattcc 
tgtatgtacg 
ttcgcctgcc 
cgtcaccaat 
caaacaacca 
tacaaatcta 



taatgatgat 
aggtgtgtat 
ctcagctcac 
aaccactatt 
cccgtaacct 
gcattatggg 
cagcacctct 
aggctcgtac 
gtacttgtat 
aacgagccct 
gaactcgcca 
tgcgagaagc 
ccaagctaag 
ctaccctacc 
ttcttttccc 
taacaacatc 
tacgtactac 
cctcctctca 
ttttttccct 
ccaccaaaat 
ga 



cgagcatgac 
ggatggaatc 
caacaacgaa 
gcatctgcgc 
cgcccgttgt 
cggcctcaac 
tcggcgatac 
tataccagca 
ggtacttgta 
gcttccaatg 
acagtcatat 
agcatgaatg 
caactaaggt 
tgccgtgcct 
tcctctcctc 
gacattctct 
aatcatcacc 
ctcgtgccct 
atttacccct 
ctctttggaa 



ttgatgctgt 60 
attctgtaac 120 
cgatggccat 180 
ttcccacgca 240 
tgctcccccc 300 
ccaccccccc 360 
atactcgcca 420 
ggagaggtgc 4 80 
aggtggtcat 540 
tccatctgga 600 
gtaatagctc 660 
tcgctcatcc 720 
accttaccgt 780 
acctgcttta 840 
tttttttttt 900 
cctaatcacc 960 
acgatcgtcc 1020 
tgacgagcta 1080 
cctccctctc 1140 
gctcacgact 1200 

1232 



<210> 
<211> 
<212> 
<213> 



11 

1129 
DNA 

Trichoderma 



reesei 



<220> 
<221> 
<222> 
<223> 



terminator 

(1) . . (1129) 

T. reesei gpdl 



terminator 



<400> 11 

ggatcccgag 

atgacgaata 

ccttgactgc 

gcgtgcgagc 

cttggcttac 

atcgcttcta 

aggtcgattc 

agaaggaatg 

ccaaaagcac 

agaatgatct 

aacaaaaaaa 



cattgtctat 
gagccaatct 
ccattcaaac 
ccgtttgtac 
gatcatgttt 
tgattaggta 
caatcatgat 
gtaggacctc 
ctacctatct 
caagaggccg 
gaaagatgag 



gaatgcaaac 
gctccacttg 
aattgtaaag 
gttattttga 
gcgatcttcg 
ctcagacaac 
ctggactggg 
tcaggcgtcc 
acctagagta 
tgaacgcaga 
gactagaaga 



aaaaatagta 
actatcttgt 
gaatatagct 
gaaagcgttg 
gtaagaatac 
acatgggaaa 
gtattccatc 
accatctgtg 
actgcacgag 
aacacactcc 
tggagtattt 



aataaatagt 
gactgtatcg 
acaagttatg 
ccatcacatg 
acagagtaac 
caagataacc 
taagccatag 
ctgcaaatcc 
aaaagaaaag 
tcccaacttt 
ccttcttaga 



aattctggcc 60 
tatgtcgaac 120 
tctcacgttt 180 
ctcacagtca 240 
gattatctcc 300 
atcgcatgca 360 
taccctcgag 420 
aagaaacccc 480 
gagcagaaga 54 0 
tcaagttttg 600 
gagctctcgg 660 



5 



tgaggtgacc 
ccgcgcctcg 
gaagcggggt 
atcaaacagc 
accgaggaac 
gttcctcaat 
aggagagaga 
gaatcagaag 



tgtcagggtt 
cctcttctct 
tatcgacgcc 
ccccgagtga 
aataaagcga 
caatcccaac 
tatttccact 
tgacaacaac 



taccgcaaac 
cctgtccttt 
accaaagatc 
tcagatccgt 
gtaaataaca 
ccccctaaaa 
accgctcacc 
gggttgagca 



cgtcggtggt 
catagaatcc 
ttgtcttggt 
aaaagaagaa 
tcaaaataag 
gcccttcccc 
accaagtgag 
catgggatat 



■tctatccaat 
cgtctccttg 
gacttatcaa 
gaagagtacg 
agtctcgttg 
ccatggtata 
gcttgccgag 
cggcgcgcc 



taatcaagtc 720 
ttgcttgatc 780 
tcctttggtg 840 
atttaaccag 900 
aaaattactt 960 
tcccggcagt 1020 
agaagaggat 1080 

1129 



<210> 12 
<211> 5733 
<212> DNA 

<213> Aspergillus nidulans 



<220> 

<221> misc_f eature 

<223> (1-5733) Sequence of plasmid pAN52-l 



<220> 

<221> promoter 

<222> (1) . . (2129) 

<223> A. nidulans gpdA promoter 

<220> 

<221> gene 

<222> (2130) . , (2304) 

<223> A. nidulans gpdA gene 

<220> 

<221> terminator 

<222> (2305) . . (3071) 

<223> A. nidulans trpC terminator 



<220> 

<221> misc_feature 

<222> (3072) . . (5726) 

<223> pUClS from Sail to EcoRI 



<400> 12 

caattccctt 

cgtttatatc 

cttgaagtaa 

agcggcgagg 

cgccagctca 

ttcaagtaca 

tggttaagat 

ctgccaacag 

ccgagaagaa 

cggagcggtg 

ggaggtagca 

gagggtcctt 

ccttttattc 

ggttttcgaa 

acgtggcact 

gttgctagca 

catgatctct 



gtatctctac 
ttgcatcgtc 
tctctgcaga 
agcctcgtcc 
ttaagagcca 
ccaacgctct 
ccacgtcttc 
ctttctcagc 
ctggaggggt 
tcaagaggat 
tagcttcatt 
tggctttcaa 
accaatgagc 
atacatccgg 
attgatcatc 
gttccaggtg 
agttagctca 



acacaggctc 
ccaaagctat 
tctttcgaca 
tgtcacaact 
gttcatgggc 
tctgatccag 
gggaagccag 
cagggccagc 
ggtgtcaagg 
attcttcgac 
tggatttgct 
tattctcaag 
attggaatga 
atgtcgaagg 
cgatagctct 
gaatgttatg 
ccacaaaagt 



aaatcaataa 
tggcgggata 
ctgaaatacg 
accaacatgg 
gttggcatga 
tcgatcatcc 
cgactggtga 
ccaagaccga 
aggagtaagc 
tctgtattat 
ttccaggctg 
tatctcgagt 
acatgaatct 
cttggggcac 
gcaaagggcg 
atgagcattg 
cagacggcgt 



gaagaacggt 
ttctgtttgc 
tcgagcctgc 
agtacgataa 
tggccgtcat 
gctgaaggcg 
cctccagcgt 
caaggcctcc 
tccttattga 
agataagatg 
agactctagc 
ttgaacttat 
gaggactgca 
ctgcgttggt 
ttgcacaatg 
tattaaatca 
aaccaaaagt 



tcgtcttttt 60 
agttggctga 120 
tccgcttgga 180 
gggccagttc 240 
gcatctgtac 300 
ctttcgaatc 360 
ccctttaagg 420 
ctccagaacg 480 
agtcggagga 540 
atgaggaatt 600 
ttggagcata 660 
tccctgtgaa 720 
atcgccatga 780 
tgaatttaga 840 
caagtcaaac 900 
ggagatatag 960 
cacacaacac 1020 



6 




aagctgtaag 
gtaccattta 
aaagtcgtat 
ataaacttta 
aaatctgtcc 
gttgacctag 
cctccaccat 
ctttctggca 
gccagacagc 
ccctccgccc 
atcggagaat 
ggggtgtata 
ttccgatctg 
ccggcgcgta 
ctctcctgct 
cagaccggga 
gtccctcagt 
tgacaaggtc 
ctgctctttt 
acctttattt 
attcctttga 
cccgcttgag 
gacgaatctg 
gttcaggatc 
tttaatagct 
agctcatctg 
cgaagagaag 
caacggtcgt 
tatttgtctc 
aaaattccgt 
tcaagcctac 
atggtataca 
ccggccgaaa 
ttgtttagag 
catctccact 
gactgggaaa 
agctggcgta 
aatggcgaat 
cgcatatggt 
cacccgccaa 
agacaagctg 
aaacgcgcga 
ataatggttt 
tgtttatttt 
atgcttcaat 
attccctttt 
gtaaaagatg 
agcggtaaga 
aaagttctgc 
cgccgcatac 
cttacggatg 
actgcggcca 
cacaacatgg 
ataccaaacg 
ctattaactg 
gcggataaag 
gataaatctg 



gatttcggca 

attctatttg 

agctaccagt 

agcctaaact 

agatcatggt 

ctgattctgg 

ttgtagaaaa 

tgcggagaga 

tctggcggct 

cgaagtggaa 

atggagcttc 

gccgtcggcg 

gtaaaagatt 

agctccctaa 

ttgcccggtg 

acacaagctg 

ccctggtagg 

gttgcgtcag 

cttttctctt 

cccctaagta 

acctttcagt 

cagacatcac 

gatataagat 

tcgataagat 

ccatgtcaac 

caatgcatta 

aatagcttag 

caagagacct 

taattgtact 

caccagcncc 

aggacacaca 

atagtaacca 

cttttttaca 

gtaatccttc 

cgacctgcag 

accctggcgt 

atagcgaaga 

ggcgcctgat 

gcactctcag 

cacccgctga 

tgaccgtctc 

gacgaaaggg 

cttagacgtc 

tctaaataca 

aatattgaaa 

ttgcggcatt 

ctgaagatca 

tccttgagag 

tatgtggcgc 

actattctca 

gcatgacagt 

acttacttct 

gggatcatgt 

acgagcgtga 

gcgaactact 

ttgcaggacc 

gagccggtga 



cggctacgga 
tgtttgatcg 
gaggaagtgg 
atacagaata 
tgaccggtgc 
agtgacccag 
atgtgacgaa 
cggacggacg 
ctgaggtgca 
aggctggtgt 
atcgaatcac 
aaatagcatg 
cacgagatag 
ttggcccatc 
tatgaaaccg 
gcagtcgacc 
cagctttgcc 
tccaacattt 
tcttttccca 
agtactttgc 
tcgagctttc 
catggatcca 
cgttggtgtc 
acgttcattt 
aagaataaaa 
atgcattgac 
cagagctatt 
acgagactga 
ttgacatgct 
tgggttcgca 
ttcatcgtag 
tgcatggttg 
actctcctat 
tttctagaag 
gcatgcaagc 
tacccaactt 
ggcccgcacc 
gcggtatttt 
tacaatctgc 
cgcgccctga 
cgggagctgc 
cctcgtgata 
aggtggcact 
ttcaaatatg 
aaggaagagt 
ttgccttcct 
gttgggtgca 
ttttcgcccc 
ggtattatcc 
gaatgacttg 
aagagaatta 
gacaacgatc 
aactcgcctt 
caccacgatg 
tactctagct 
acttctgcgc 
gcgtgggtct 



agacggagaa 
agacctaata 
actcaaatcg 
agataggtgg 
ctggatcttc 
agggtcatga 
ctcgtgagct 
cagagagaag 
gtggatgatt 
gcccctcgtt 
cggcagtaag 
ccattaacct 
taccttctcc 
cggcatctgt 
gaaaggccgc 
catccggtgc 
ccgtctgtcc 
gttgccatat 
tcttcagtat 
tacatccata 
ccacttcatc 
cttaacgtta 
gatgtcagct 
gtccaagcag 
cgcgttttcg 
tgcaacctag 
ttcattttcg 
ggaatccgct 
cctcttcttt 
aagataattg 
gtataaacct 
cctagtgaat 
gagtcgttta 
tcctcgtgta 
ttggeactgg 
aatcgccttg 
gatcgccctt 
ctccttacgc 
tctgatgccg 
cgggcttgtc 
atgtgtcaga 
cgcctatttt 
tttcggggaa 
tatccgctca 
atgagtattc 
gtttttgctc 
cgagtgggtt 
gaagaacgtt 
cgtattgacg 
gttgagtact 
tgcagtgctg 
ggaggaccga 
gatcgttggg 
cctgtagcaa 
tcccggcaac 
tcggcccttc 
cgcggtatca 



gccaccttca 
cagcccctac 
acttcagcaa 
agagcttata 
ctatagaatc 
cttgagccta 
ctgtacagtg 
ggctgagtaa 
attaatccgg 
gaccaagaat 
cgaaggagaa 
aggtacagaa 
gaagtaggta 
agggcgtcca 
tcaggagctg 
tctgcactcg 
gcccggtgtg 
tttcctgctc 
attcatcttc 
ctccatcctt 
gcagcttgac 
ctgaaatcat 
ccggagttga 
caaagagtgc 
ggtttacctc 
taacgccttn 
ggagacgaga 
cttggctcca 
actctgatag 
catgtttctt 
cgaaatcant 
gctccgtaac 
cccagaatgc 
ctgtgtaagc 
ccgtcgtttt 
cagcacatcc 
cccaacagtt 
atctgtgcgg 
catagttaag 
tgctcccggc 
ggttttcacc 
tataggttaa 
atgtgcgcgg 
tgagacaata 
aacatttccg 
acccagaaac 
acatcgaact 
ttccaatgat 
ccgggcaaga 
caccagtcac 
ccataaccat 
aggagctaac 
aaccggagct 
tggcaacaac 
aattaataga 
cggctggctg 
ttgcagcact 



gtggactcga 
aacgaccatc 
catctcctgg 
ccgagctccc 
atccttattc 
aaatccgccg 
accggtgact 
taagccactg 
gaccggccgc 
ctattgcatc 
tgtgaagcca 
gtccaattgc 
gagcgagtac 
aatatcgtgc 
gccagcggcg 
acctgctgag 
tcggcggggt 
tccccaccag 
ccatccaaga 
cccatccctt 
taacagctac 
caaacagctt 
gacaaatggt 
cttctagtga 
ttccagatac 
caggctccgg 
tcaagcagat 
cgcgactata 
cttgactatg 
ccttgaactc 
tcctactaag 
acccaatacg 
acaggtacac 
gcccactcca 
acaacgtcgt 
ccctttcgcc 
gcgcagcctg 
tatttcacac 
ccagccccga 
atccgcttac 
gtcatcaccg 
tgtcatgata 
aacccctatt 
accctgataa 
tgtcgccctt 
gctggtgaaa 
ggatctcaac 
gagcactttt 
gcaactcggt 
agaaaagcat 
gagtgataac 
cgcttttttg 
gaatgaagcc 
gttgcgcaaa 
ctggatggag 
gtttattgct 
ggggccagat 



1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
'2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
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ggtaagccct 
cgaaatagac 
caagtttact 
taggtgaaga 
cactgagcgt 
cgcgtaatct 
gatcaagagc 
aatactgtcc 
cctacatacc 
tgtcttaccg 
acggggggtt 
ctacagcgtg 
ccggtaagcg 
tggtatcttt 
tgctcgtcag 
ctggcctttt 
gataaccgta 
cgcagcgagt 
gcgcgttggc 
agtgagcgca 
tttatgcttc 
aacagctatg 



cccgtatcgt 
agatcgctga 
catatatact 
tcctttttga 
cagaccccgt 
gctgcttgca 
taccaactct 
ttctagtgta 
tcgctctgct 
ggttggactc 
cgtgcacaca 
agctatgaga 
gcagggtcgg 
atagtcctgt 
gggggcggag 
gctggccttt 
ttaccgcctt 
cagtgagcga 
cgattcatta 
acgcaattaa 
cggctcgtat 
accatgatta 



agttatctac 
gataggtgcc 
ttagattgat 
taatctcatg 
agaaaagatc 
aacaaaaaaa 
ttttccgaag 
gccgtagtta 
aatcctgtta 
aagacgatag 
gcccagcttg 
aagcgccacg 
aacaggagag 
cgggtttcgc 
cctatggaaa 
tgctcacatg 
tgagtgagct 
ggaagcggaa. 
atgcagctgg 
tgtgagttag 
gttgtgtgga 
cgaattgcgg 



acgacgggga 
tcactgatta 
ttaaaacttc 
accaaaatcc 
aaaggatctt 
ccaccgctac 
gtaactggct 
ggccaccact 
ccagtggctg 
ttaccggata 
gagcgaacga 
cttcccgaag 
cgcacgaggg 
cacctctgac 
aacgccagca 
ttctttcctg 
gataccgctc 
gagcgcccaa 
cacgacaggt 
ctcactcatt 
attgtgagcg 
ccg 



gtcaggcaac 
agcattggta 
atttttaatt 
cttaacgtga 
cttgagatcc 
cagcggtggt 
tcagcagagc 
tcaagaactc 
ctgccagtgg 
aggcgcagcg 
cctacaccga 
ggagaaaggc 
agcttccagg 
ttgagcgtcg 
acgcggcctt 
cgttatcccc 
gccgcagccg 
tacgcaaacc 
ttcccgactg 
aggcacccca 
gataacaatt 



tatggatgaa 4500 
actgtcagac 4560 
taaaaggatc 4620 
gttttcgttc 4680 
tttttttctg 4740 
ttgtttgccg 4800 
gcagatacca 4860 
tgtagcaccg 4920 
cgataagtcg 4980 
gtcgggctga 5040 
actgagatac 5100 
ggacaggtat 5160 
gggaaacgcc 5220 
atttttgtga 5280 
tttacggttc 5340 
tgattctgtg 5400 
aacgaccgag 54 60 
gcctctcccc 5520 
gaaagcgggc 5580 
ggctttacac 5640 
tcacacagga 5700 

5733 



<210> 13 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR 5' primer 
<400> 13 

gtcaaccgcg gactgcgcat catgaagttc ttcgccatc 39 



<210> 14 
<211> 66 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR 5* primer 
<400> 14 

tctagcaagc ttggctctag ttctggaacc gcaccaggcg gcagcaacgg caacggcaat 60 
gt-ttgc 66 



<210> 15 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR 3* primer 
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<400> 15 

tcgtacaagc tttcaagcac cgacggcggt 



30 



<210> 
<211> 
<212> 
<213> 



16 
33 
DNA 

Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: PGR 5* primer 
<400> 16 

tctagctcta gaagcaacgg caacggcaat gtt 



33 



<210> 
<211> 
<212> 
<213> 



17 
37 
DNA 

Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: PGR 3' primer 
<400> 17 

tgctagtcga cctgctagca gcaccgacgg cggtctg 



37 



<210> 
<211> 
<212> 
<213> 



18 

4614 
DNA 

Saccharomyces 



cerevisiae 



<220> 

<223> S. cerevisiae FLOl coding sequence 



<400> 18 

atgacaatgc 

gtggcctcag 

ataaattttt 

gcttatggat 

attgattata 

tcctatggaa 

tactggagta 

acaggttatt 

gactctgcaa 

ccgccgatca 

ccacctaata 

gtttactcga 

accactgtaa 

tctaactgta 

gaaccatgga 

aacggcgttc 

atcataacta 

acagtcactg 

acaacagcca 

tctaccgaat 



ctcatcgcta 
gagccacaga 
accagtattc 
atgcctcaaa 
atattccctg 
actggggatg 
ctgatttatt 
ttttaccacc 
ttctatcagt 
catcaacgaa 
tcgaaggaac 
acgctgtttc 
gtgatgactt 
ctgtccctga 
ccggtacttt 
caactgacga 
caactgagcc 
gcaccaatgg 
ctactgccat 
tgaccacagt 



tatgtttttg 
ggcgtgctta 
attgaaagat 
aaccaaacta 
tgttagttca 
caaaggaatg 
tggtttctat 
acagacgggt 
aggtggtgca 
ctttaccatt 
cgtctatatg 
ttggggtaca 
cgaagggtac 
cccttcaaat 
cacttctaca 
aaccgtcatt 
atggaacagc 
tgtacgaact 
aactacaact 
caccggtacc 



gcagtcttta 
ccagcaggcc 
tcctccacat 
ggttctgtcg 
tcaggcacat 
ggtgcttgtt 
actaccccaa 
tcttacacat 
accgcgttca 
gacggtatca 
tacgctggct 
cttccaatta 
gtctattcct 
tatgctgtca 
tctactgaaa 
gtcatcagaa 
acttttacct 
gacgaaacca 
gagccatgga 
aatggtttgc 



cacttctggc 
agaggaaaag 
attcgaatgc 
gaggacaaac 
ttccttgtcc 
ctaatagtca 
caaacgtaac 
tcaagtttgc 
actgttgtgc 
agccatgggg 
actattatcc 
gtgtgacact 
ttgacgatga 
gtaccactac 
tgaccaccgt 
ctccaacaac 
ctacttctac 
tcattgtaat 
acagcacttt 
caactgatga 



actaactagt 60 
tgggatgaat 120 
agcatatatg 180 
tgatatctcg 240 
tcaagaagat 300 
aggaattgca 360 
cctagaaatg 420 
tacagttgac 480 
tcaacagcaa 540 
tggaagtttg 600 
aatgaaggtt 660 
tccagatggt 720 
cctaagtcaa 780 
aactacaacg 840 
caccggtacc 900 
tgctagcacc 960 
cgaattgacc 1020 
cagaacacca 1080 
tacctctact 1140 
gaccatcatt 1200 
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gtcatcagaa 
acttttacct 
gatgaaacca 
cagccatgga 
aacggtttgc 
gccatgacta 
accgtcaccg 
acaacagcca 
tccactgaaa 
gtcatcagaa 
acttttacct 
gatgagacca 
cagccatgga 
aacggtttgc 
gccatgacta 
accgtcaccg 
acaacagcca 
tccactgaaa 
gtcatcagaa 
actttcacct 
gacgaaaccg 
gaaccatgga 
aacggcgttc 
atcagcacca 
accattactg 
actagtgaag 
tctactgaaa 
gtcatcagaa 
actttcactt 
gacgaaactg 
gaaccatgga 
aacggtcaac 
gttacaacca 
actgtcactg 
actactgcca 
acgtcttcgc 
tcctcagtaa 
tcctcagtca 
tcatccgtca 
gctggttctg 
tcttcttcat 
ttaccacctg 
tgcgagtctc 
actgttagcg 
acaaagcaaa 
gttacaattt 
tctacaagca 
tccaccacag 
gtgtgttccg 
gttgttacgg 
aaaatgaaca 
aatactgtag 
acctcttcgc 
attggtcaca 
agttccgggt 
ggatatagta 
gccggtagtg 



caccaacaac 
ctacatccac 
tcattgtcat 
acgacacttt 
caactgatga 
caactcagcc 
gtaccaatgg 
ctactgccat 
tgaccaccgt 
caccaacaac 
ctacttctac 
tcattgtcat 
acgacacttt 
caactgatga 
caactcagcc 
gtaccaacgg 
ctactgccat 
tgaccaccgt 
ctccaactag 
ctacatccac 
tgattgttat 
ctggtacttt 
caactgacga 
ccactgaacc 
gaaccaatgg 
gtctaatcag 
tgaccaccgt 
ctccaaccag 
cgacttccac 
tgattgttat 
ctggtacttt 
caactgacga 
ccactgaacc 
gaaccaatgg 
tctcatccag 
gtccaattat 
tttcttcctc 
tttcttcttc 
ttccaaccag 
tctcttcttc 
cattaccact 
ctaccactac 
atgtgtgcac 
gcgtcacaac 
ccaaagggac 
cttcttgtga 
ctgctactat 
aatcgaggca 
aaactgcttc 
tctatcctac 
gtgctaccgg 
ctgctgagac 
tttcaagatc 
gcagtagtgt 
tgagtactat 
cagcttcttt 
gtttaagtgt 



agccactact 
tgaaatgacc 
cagaacacca 
tacctctaca 
aaccatcatt 
atggaacgac 
tttgccaact 
gactacaact 
caccggtacc 
agccactact 
cgaattgacc 
cagaacacca 
tacctctaca 
aaccatcatt 
atggaacgac 
tttgccaact 
gactacaact 
caccggtacc 
tgaaggtcta 
tgagatgacc 
cagaactcca 
tacttctaca 
aaccgtcatt 
atggactggt 
tcaaccaact 
caccaccact 
caccggtacc 
tgaaggtcta 
tgaggttacc 
cagaactcca 
cacttctaca 
aaccgtgatt 
atggactggt 
cttgccaact 
tttgtcatca 
taccccattc 
agtcacttct 
tacaacaacc 
tagttccacc 
ctcttttatc 
tgttaccagt 
aaaaacgagc 
tgaatccatc 
agagtatacc 
aacagagcaa 
atctgacgta 
taacggcgtt 
acaaacaacg 
acctgccatt 
atggaggcca 
tgagacaaca 
gattaccaat 
taatcacgct 
tgtttctgta 
gtcgcaacag 
agaaatttca 
cttcattgcg 



gccatgacta 
accgtcaccg 
acaacagcca 
tccactgaaa 
gtcatcagaa 
acttttacct 
gatgagacca 
cagccatgga 
aacggtttgc 
gccataacta 
acagtcaccg 
acaacagcca 
tccactgaaa 
gtcatcagaa 
acttttacct 
gatgagacca 
cagccatgga 
aacggcgttc 
atcagcacca 
accgtcaccg 
accagtgaag 
tctactgaaa 
gtcatcagaa 
acttttactt 
gacgaaaccg 
gaaccatgga 
aacggcgttc 
atcagcacca 
accatcactg 
accagtgaag 
tctgctgaaa 
gttatcagaa 
acttttactt 
gatgaaactg 
tcatcttcag 
tatcctagca 
tctctattca 
tccacttcta 
tctggttctt 
tcttctgaat 
gcgacaacaa 
gaacaaacca 
tcccctgcga 
acatggtgcc 
accacagaaa 
tgctctaaga 
actacagaat 
ctagttactg 
gtttcgacgg 
cagactgcga 
accaatactt 
actggagctg 
gaaacacaga 
tccgaaactg 
cctcgtagca 
acgtatgctg 
tccttattgc 



caactcagcc 
gtaccaacgg 
ctactgctat 
tgaccaccgt 
caccaacaac 
ctacatccac 
tcattgtcat 
acgacacttt 
caactgatga 
caactgagcc 
gtaccaatgg 
ctactgccat 
tgaccaccgt 
caccaacaac 
ctacatccac 
tcattgtcat 
acgacacttt 
caactgacga 
ccactgaacc 
gtactaacgg 
gtttggttac 
tgaccaccat 
ctccaaccag 
ctacatctac 
ttattgttat 
ctggtacttt 
caactgacga 
ccactgaacc 
gaaccaacgg 
gtctaatcag 
tgaccaccgt 
ctccaaccag 
cgacttccac 
tcattgttgt 
gacaaatcac 
atggaacttc 
cttcttctcc 
tattttctga 
ctgagagcga 
catcaaaatc 
gccaggaaac 
ctttggttac 
ttgtttccac 
ctatttctac 
caacaaaaca 
ctgcttctcc 
acacaacatg 
ttacttcctg 
ccacggctac 
atgaagagtc 
tagctgctga 
ctgagacgaa 
cggcttccgc 
gcaacaccaa 
caccagcaag 
gcagtgccaa 
tggcaattat 



atggaacgac 
tttgccaact 
gactacaact 
caccggtacc 
agccactact 
tgaaatgacc 
cagaacacca 
tacctctaca 
aaccatcatt 
atggaacagc 
tttgccaact 
gactacaact 
caccggtacc 
agccactact 
tgaaatgacc 
cagaacacca 
tacctctaca 
aaccgtcatt 
atggactggt 
tcaaccaact 
aaccaccact 
tactggaacc 
tgaaggtcta 
tgaaatgacc 
cagaactcca 
cacttctaca 
aaccgtcatt 
atggactggc 
tcaaccaact 
caccaccact 
caccggtact 
tgaaggtttg 
tgaaatgtct 
caaaactcca 
cagctctatc 
tgtgatttct 
agtcatttct 
atcatctaaa 
aacgagttca 
tcctacatat 
tgcttcttca 
cgtgacatcc 
agctactgtt 
tacagagaca 
aaccacggta 
agccattgta 
gtgtcctatt 
cgaatctggt 
tgtgaatgat 
tgtcagctct 
aacgactacc 
aacagtagtc 
gaccgatgtg 
gagtctaaca 
cagcatggta 
cagcttactg 
ttaa 



1260 
1320 
1380 
1440 
•1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4614 
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<210> 19 
<211> 5857 
<212> DNA 

<213> Saccharomyces cerevisiae and E. coli 
<220> 

<221> promoter 
<222> (1) . . (452) 

<223> S. cerevisiae GALl promoter 
<220> 

* 

<221> misc_feature 

<C223> (476-495) E. coli T7 promoter /priming site 
<220> 

<221> misc_feature 

<223> (502-601) E. coli multiple cloning site 
<220> 

<221> misc_feature 

<223> (609-857) S. cerevisiae CYCl transcription terminator 
<220> 

<221> misc_feature 

<223> (1039-1712) E. coli pMBl (pUC-derived) origin 
<220> 

<221> gene 

<222> (1857) . . (2717) 

<223> E. coli ampicillin resistance gene 

<220> 

<221> gene 

<222> (2735) . . (3842) 

<223> S. cerevisiae URA3 gene 

<220> 

<221> misc_feature 

<223> (3846-5317) S. cerevisiae 2 micron origin 
<220> 

<221> misc_feature 

<223> (5385-5840) E. coli fl origin 
<220> 

<221> misc_feature 

<223> (1-5857) Sequence of pYES2 



<400> 19 

acggattaga 

cctcgtcctc 

acaataaaga 

ctggccccac 

ttagtttttt 

taacagatat 



agccgccgag 
accggtcgcg 
ttctacaata 
aaaccttcaa 
agccttattt 
ataaatgcaa 



cgggtgacag 
ttcctgaaac 
ctagctttta 
atgaacgaat 
ctggggtaat 
aaactgcatt 



ccctccgaag 
gcagatgtgc 
tggttatgaa 
caaattaaca 
taatcagcga 
aaccacttta 



gaagactctc 
ctcgcgccgc 
gaggaaaaat 
accataggat 
agcgatgatt 
actaatactt 



ctccgtgcgt 60 
actgctccga 120 
tggcagtaac 180 
gataatgcga 240 
tttgatctat 300 
tcaacatttt 360 
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■ 

I 

* 

cggtttgtat tacttcttat tcaaatgtaa taaaagtatc aacaaaaaat tgttaatata 420 
cctctatact ttaacgtcaa ggagaaaaaa ccccggatcg gactactagc agctgtaata 480 
cgactcacta tagggaatat taagcttggt accgagctcg gatccactag taacggccgc 540 
cagtgtgctg gaattctgca gatatccatc acactggcgg ccgctcgagc atgcatctag 600 
agggccgcat catgtaatta gttatgtcac gcttacattc acgccctccc cccacatccg 660 
ctctaaccga aaaggaagga gttagacaac ctgaagtcta ggtccctatt tattttttta 720 
tagttatgtt agtattaaga acgttattta tatttcaaat ttttcttttt tttctgtaca 780 
gacgcgtgta cgcatgtaac attatactga aaaccttgct tgagaaggtt ttgggacgct 840 
cgaaggcttt aatttgcggc cctgcattaa tgaatcggcc aacgcgcggg gagaggcggt 900 
ttgcgtattg ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg 960 
ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac agaatcaggg 1020 
gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa agcccaggaa ccgtaaaaag 1080 
gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca caaaaatcga 1140 
cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct 1200 
ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata cctgtccgcc 1260 
tttctccctt cgggaagcgt ggcgctttct catagctcac gctgtaggta tctcagttcg 1320 
gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc 1380 
tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga cttatcgcca 1440 
ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag 1500 
ttcttgaagt ggtggcctaa ctacggctac actagaagga cagtatttgg tatctgcgct 1560 
ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc 1620 
accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga 1680 
tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa cgaaaactca 1740 
cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat ccttttaaat 1800 
taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc tgacagttac 1860 
caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc atccatagtt 1920 
gcctgactcc ccgtcgtgta gataactacg atacgggagc gcttaccatc tggccccagt 1980 
gctgcaatga taccgcgaga cccacgctca ccggctccag atttatcagc aataaaccag 2040 
ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgcctc cattcagtct 2100 
attaattgtt gccgggaagc tagagtaagt agttcgccag ttaatagttt gcgcaacgtt 2160 
gttggcattg ctacaggcat cgtggtgtca ctctcgtcgt ttggtatggc ttcattcagc 2220 
tccggttccc aacgatcaag gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt 2280 
agctccttcg gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt atcactcatg 2340 
gttatggcag cactgcataa ttctcttact gtcatgccat ccgtaagatg cttttctgtg 2400 
actggtgagt actcaaccaa gtcattctga gaatagtgta tgcggcgacc gagttgctct 2460 
tgcccggcgt caatacggga taatagtgta tcacatagca gaactttaaa agtgctcatc 2520 
attggaaaac gttcttcggg gcgaaaactc tcaaggatct taccgctgtt gagatccagt 2580 
tcgatgtaac ccactcgtgc acccaactga tcttcagcat cttttacttt caccagcgtt 2640 
tctgggtgag caaaa'acagg aaggcaaaat gccgcaaaaa agggaataag ggcgacacgg 2700 
aaatgttgaa tactcatact cttccttttt caatgggtaa taactgatat aattaaattg 2760 
aagctctaat ttgtgagttt agtatacatg catttactta taatacagtt ttttagtttt 2820 
gctggccgca tcttctcaaa tatgcttccc agcctgcttt tctgtaacgt tcaccctcta 2880 
ccttagcatc ccttcccttt gcaaatagtc ctcttccaac aataataatg tcagatcctg 2940 
tagagaccac atcatccacg gttctatact gttgacccaa tgcgtctccc ttgtcatcta 3000 
aacccacacc gggtgtcata atcaaccaat cgtaaccttc atctcttcca cccatgtctc 3060 
tttgagcaat aaagccgata acaaaatctt tgtcgctctt cgcaatgtca acagtaccct 3120 
tagtatattc tccagtagat agggagccct tgcatgacaa ttctgctaac atcaaaaggc 3180 
ctctaggttc ctttgttact tcttctgccg cctgcttcaa accgctaaca atacctgggc 3240 
ccaccacacc gtgtgcattc gtaatgtctg cccattctgc tattctgtat acacccgcag 3300 
agtactgcaa tttgactgta ttaccaatgt cagcaaattt tctgtcttcg aagagtaaaa 3360 
aattgtactt ggcggataat gcctttagcg gcttaactgt gccctccatg gaaaaatcag 3420 
tcaagatatc cacatgtgtt tttagtaaac aaattttggg acctaatgct tcaactaact 3480 
ccagtaattc cttggtggta cgaacatcca atgaagcaca caagtttgtt tgcttttcgt 3540 
gcatgatatt aaatagcttg gcagcaacag gactaggatg agtagcagca cgttccttat 3600 
atgtagcttt cgacatgatt tatcttcgtt tcctgcaggt ttttgttctg tgcagttggg 3660 
ttaagaatac tgggcaattt catgtttctt caacactaca tatgcgtata tataccaatc 3720 
taagtctgtg ctccttcctt cgttcttcct tctgttcgga gattaccgaa tcaaaaaaat 3780 
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ttcaaagaaa 
ctagcttatc 
ctagatactc 
cttaccactc 
ctatcttcgc 
acttctacaa 
tcgaatacgc 
tcgtacttgt 
agatagtata 
atgtatttcg 
atccccggtt 
agcatctgtg 
aaagaatctg 
acgaagaatc 
tttcaaacaa 
ttttaccaac 
tatttttcta 
gtctcttgat 
tctattttct 
gaagctgcgg 
ggattgcgca 
aattatgaac 
ttcgtattgt 
gtaatactag 
cgaaaggtgg 
cttttgagca 
tcagaaaagc 
tattttgtta 
cgaaatcggc 
tccagtttcc 
aagggtctat 
gtcgaggtgc 
acggggaaag 
tagggcggtg 
tggggcgcta 



ccgaaatcaa 
gatgataagc 
cgtctactgt 
ttttgttact 
gatgtagtaa 
tggctgccat 
tttgaggaga 
tacccatcat 
tttgaacctg 
gttcctggag 
cattttctgc 
cttcattttg 
agctgcattt 
tgtgcttcat 
agaatctgag 
aaagaatcta 
acaaagcatc 
aactttttgc 
cttccataaa 
gtgcattttt 
tactttgtga 
ggtttcttct 
tttcgattca 
agataaacat 
atgggtaggt 
atgtttgtgg 
cccaaaaaca 
aaattcgcgt 
aaaatccctt 
aacaagagtc 
cagggcgatg 
cgtaaagcag 
ccggcgaacg 
ggaagtgtag 
cagggcgcgt 



aaaaaagaat 
tgtcaaagat 
acgatacact 
ctattgatcc 
aactagctag 
cattattatc 
tacagcctaa 
tgaattttga 
tataataata 
aaactattgc 
gtttccatct 
tagaacaaaa 
ttacagaaca 
ttttgtaaaa 
ctgcattttt 
tacttctttt 
ttagattact 
actgtaggtc 
aaaagcctga 
tcaagataaa 
acagaaagtg 
attttgtctc 
ctctatgaat 
aaaaaatgta 
tatataggga 
aagcggtatt 
ggaagattgt 
taaatttttg 
ataaatcaaa 
cactattaaa 
gcccactacg 
taaatcggaa 
tggcgagaaa 
gggtcacgct 
ggggatgatc 



aaaaaaaaaa 
gagaattaat 
tccgctcagg 
agctcagcaa 
accgagaaag 
cgatgtgacg 
tatccgacaa 
acatccgaac 
tatagtctag 
atctattgca 
tgcacttcaa 
atgcaacgcg 
gaaatgcaac 
caaaaatgca 
acagaacaga 
ttgttctaca 
ttttttctcc 
cgttaaggtt 
ctccacttcc 
ggcatccccg 
atagcgttga 
tatatactac 
agttcttact 
gaggtcgagt 
tatagcacag 
cgcaatggga 
ataagcaaat 
ttaaatcagc 
agaatagacc 
gaacgtggac 
tgaaccatca 
gggtaaacgg 
ggaagggaag 
gggcgtaacc 
cactagt 



tgatgaattg 
tccacggact 
tccttgtcct 
aggcagtgtg 
agactagaaa 
ctgcagcttc 
actgttttac 
ctgggagttt 
cgctttacgg 
taggtaatct 
tagcatatct 
agagcgctaa 
gcgaaagcgc 
acgcgacgag 
aatgcaacgc 
aaaatgcatc 
tttgtgcgct 
agaagaaggc 
cgcgtttact 
attatattct 
tgattcttca 
gtataggaaa 
acaatttttt 
ttagatgcaa 
agatatatag 
agctccaccc 
atttaaattg 
tcatttttta 
gagatagggt 
tccaacgtca 
ccctaatcaa 
atgcccccat 
aaagcgaaag 
accacacccg 



aattgaaaag 3840 
atagactata 3900 
ttaacgaggc 3960 
atctaagatt 4020 
tgcaaaaggc 4080 
tcaatgatat 4140 
agatttacga 4200 
tccctgaaac 4260 
aagacaatgt 4320 
tgcacgtcgc 4380 
ttgttaacga 4440 
tttttcaaac 4500 
tattttacca 4560 
agcgctaatt 4620 
gagagcgcta 4 680 
ccgagagcgc 4740 
ctataatgca 4800 
tactttggtg 4860 
gattactagc 4920 
ataccgatgt 4980 
ttggtcagaa 5040 
tgtttacatt 5100 
tgtctaaaga 5160 
gttcaaggag 5220 
caaagagata 5280 
cggttgataa 5340 
taaacgttaa 5400 
acgaatagcc 54 60 
tgagtgttgt 5520 
aagggcgaaa 5580 
gttttttggg 5640 
ttagagcttg 5700 
gagcgggggc 5760 
ccgcgcttaa 5820 

5857 



<210> 
<211> 
<212> 
<213> 



20 
403 

DNA 

Trichoderma 



reesei 



<220> 
<221> 
<223> 



misc_f eature 
(1-403) T. reesei 



hfb2 coding sequence 



<220> 

<221> intron 
<222> (131) . . (200) 

<220> 

<221> intron 
<222> (287) . . (358) 



<400> 20 

atgcagttct tcgccgtcgc cctcttcgcc accagcgccc tggctgctgt ctgccctacc 60 
ggcctcttct ccaaccctct gtgctgtgcc accaacgtcc tcgacctcat tggcgttgac 120 
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tgcaagaccc 
acttacacga 
ctgtgccagc 
caatggcaaa 
cgaccaggct 



gtatgttgaa 
tgctttacag 
aagggctcca 
gaagtaaaaa 
ctcctgtgcc 



ttccaatctc 
ctaccatcgc 
agcctctttg 
gacatttggg 
agaaggccat 



tgggcatcct 
cgtcgacact 
ctgcgttgct 
cctgggatcg 
cggcaccttc 



gacattggac 
ggcgccatct 
cccgtggtaa 
ctaactcttg 
taa 



gatacagttg 
tccaggctca 
gtagtgctcg 
atatcaaggc 



180 
240 
300 
360 
403 



<210> 21 
<211> 59 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR 5' primer 
<400> 21 

cggaggagct cgacgacttc gagcagcccg agctgcacgc aggctgtctg ccctaccgg 59 



<210> 22 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR 3* primer 

<400> 22 

tcattggatc cttagaaggt gccgatggc 29 

<210> 23 
<211> 679 
<212> DNA 

<213> Schizophyllum commune 
<220> ' 

<221> misc__f eature 

<223> (1-679) SC3 coding sequence 

<220> 

<221> misc_feature 
<223> (1-92) 1st cDNA 

<220> 

<221> misc_feature 
<223> (146-183) 2nd cDNA 

<220> 

<221> misc_feature 
<223> (240-317) 3rd cDNA 

<220> 

<221> misc_feature 
<223> (374-469) 4th cDNA 

<220> 



14 



<221> misc_f eature 
<223> (524-586) 5th cDNA 

<220> 

<221> misc_feature 
<223> (635-679) 6th cDNA 



<400> 23 

atgttcgccc 

gctgccctcc 

gtcttgctga 

cggtgagtag 

cgccctcgac 

gcaaccaggt 

ccacatcgta 

cctcagcgac 

ttgttcactg 

cagcggctgt 

catgcgtgtc 

ccccatcaac 



gtctccccgt 
caggtggcca 
tgaagccccg 
ctttctcgcc 
gacgaccatc. 
tcaatcggta 
caggcgagca 
ctcaacgttc 
tcccaattac 
tcggcgcaga 
cctttctccg 
atcctctga 



cgtgttcctc 
cccgggcacg 
tatagcacgc 
gtcgacgact 
gccgccggtg 
cgtacatcaa 
gcagccctgt 
tcgttggcat 
tgcgcactga 
ccgtctgctg 
ctaatcatct 



tacgccttcg 
acgtacgtcg 
cgccggttac 
cgaacgcatt 
gcacgtgtac 
agcggcacga 
taccgccctc 
cagctgctct 
cagactttgc 
cgaaaacacc 
gtagaacggg 



tcgcgttcgg 
acctctcacc 
gacgacggtg 
ggctaatttt 
tacggggtcg 
ccaggcatct 
ctcggcctgc 
cccctcactg 
caggtcatcg 
caattcgtat 
ctgatcaaca 



cgccctcgtc 60 
gtcctctaat 120 
acggtgacca 180 
tgctcatagc 240 
ctctcttgct 300 
cagctgacgg 360 
tcggcattgt 420 
tgagatcttt 480 
gtgtcggagg 540 
gtatactttc 600 
tcggttgcac 660 

679 



<210> 24 

<211> 63 . ■ 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR 5' primer 

<400> 24 

actacacgga ggagctcgac gacttcgagc agcccgagct gcacgcaggg tggccacccg 60 

ggc 63 



<210> 25 
<211> 30 
<212> DNA 

<213> Artificial Sequence 



♦ 



4 



<220> 

<223> Description of Artificial Sequence: PGR 3* primer 
<400> 25 

tcgtacggat cctcagagga tgttgatggg 30 

<210> 26 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR 5' primer 
<400> 26 

ggaattccgc ggactgcgca tcatgaagtt cttcgccatc gcc 43 
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<210> 27 
<211> 80 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR 3' primer 
<400> 27 

tgaattccat atgttaggta ccaccggggc ccatgccggt agaagtagaa gccccgggag 60 
caccgacggc ggtctggcac 80 

<210> 28 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR 5' primer 



<400> 28 

tgaattcggt acccaggctt gctcaagcgt c 

<210> 29 
<2il> 34 
<212> DNA 

<213> Artificial Sequence 



31 



<220> 

<223> Description of Artificial Sequence: PGR 3' primer 
<400> 29 

tgaattccat atgtcacagg cactgagagt agta 34 

<210> 30 
<211> 48 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR 5' primer 
<400> 30 

gaattcggta ccctcgtccc tcgcggtccc gccgaagtga acctggtg 48 

<210> 31 
<211> 34 
<212> DNA 

<2i3> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: PGR 3* primer 



<400> 31 

tgaattccat atgctaaccc cgtttcatct ccag 



34 



<210> 
<211> 
<212> 
<213> 



32 

918 

DNA 

Trichoderma 



reesei 



<220> 
<221> 
<222> 
<223> 



terminator 
(1) . . (918) 
T. reesei hfbl 



terminator 



<400> 32 

gatgcccgcc 

atttccggta 

atattccatt 

caaaaagact 

ttatttgcaa 

gtggtgtgac 

atacaggatc 

acagtgtaag 

gatatacctc 

agcgagccaa 

cacattgtcg 

cacagattgt 

aaagacctga 

atccttcacc 

cgaaagaacc 

gggggggtaa 



cggggtcaag 

ctgggaaagt 

cgtacgccat 

gctattctct 

gtaggagtgg 

tcgacgagga 

gtctggcaac 

aggaaaagga 

caatatctgt ■ 

gacctcaata 

tcgaaagatg 

ttgccgtccc 

aagcaaaccc 

aagtaaaaga 

aggaaaagaa 

aatgaaat 



gtgtgcccgt 
tggctccacg 
tcttattctc 
ttgtgacatg 
gagattcgcc 
tggactgaga 
ccacggatcc 
aatgtccgcc 
gggaacgaga 
tgcacatcca 
gcatcgtacc 
ccaactccaa 
tttttgcgac 
aaaagaagat 
taaatctatc 



gagaaagccc 
tgtttgggca 
caatatttca 
ccggaaggga 
ttagagaaag 
gtgttaggat 
tatgacttga 
ttcagctgat 
catgacatat 
aagccaaaca 
caaatcatca 
aacgttacta 
tcaattccct 
tgagataata 
acgcaccttg 



acaaagtgtt 
ggtttgggca 
gtacactttt 
acaattgctc 
tagagaagct 
taggtcgaac 
tgcaatggtg 
atccacgcca 
ttgtgggaac 
ttggcaagac 
gctctcatta 
caaaagacat 
cctttgtcct 
catgaaaagc 
tccccacact 



gatgaggacc 60 
agttgtgtag 120 
cttcataaat 180 
ttggtctctg 240 
gtgcttgacc 300 
gttgaagtgt 360 
aagatgaatg 420 
atgatacagc 480 
aacttcaaac 540 
gagagacagt 600 
tcgcctaaac 660 
gggcgaatgc 720 
cggaatgatg 780 
acaacggaaa 840 
aaaagcaaca 900 

918 



<210> 33 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR 5* primer 

<400> 33 

gacctcgatg cccgcccggg gtcaag ' 26 



<210> 34 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PGR 3' primer 
<400> 34 

gtcgacattt cattttaccc ccctcg 26 
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<210> 35 
<211> 1190 
<212> DNA 

<213> Trichoderma reesei 
<220> 

<221> promoter 

<222> (1) . . (1190) 

<223> T. reesei hfb2 promoter 



<400> 35 

ctcgagcagc 

acactgccgt 

cgctttgtat 

aggacgatcg 

ggagcaagat 

ccactccctt 

actaacaacc 

gagagaggga 

aatgtcacat 

aatagtcctc 

tttcgaagta 

ttggggagta 

cagcctcttg 

ctcatcatgg 

caagacacca 

gagagattgc 

agccaacgtg 

ggccagaaaa 

ctcatcagca 

tccaaacaca 



tgaagcttgc 
cgacgtctct 
tattagagta 
cggcacgata 
tcagcccaag 
tggccaaatc 
agggatggga 
gagagacaat 
ccgagtcttg 
tttcttgagg 
ttatctggag 
gcccaacgaa 
gagccgtcat 
ccacatacta 
aggccccagc 
gatgagccct 
atgaaaacaa 
aaaagtataa 
cactcatcta 
aacattcttt 



atgcctgcat 
cttccgacct 
tatgcaagtc 
cgccctcgtt 
ggagatggac 
agcttgcatg 
aacgaagcct 
gagtgccaca 
gggcctctgt 
actggatacc 
gatagaagac 
aggtaactcc 
agatcacatc 
ctacatacgt 
atcaagcctc 
ctccctacct 
ggacatgagg 
agaagacctc 
caaccatcac 
gttgaatacc 



cctttgtgag 
tggccagctg 
tcaggactat 
ctcattggcc 
ggcagggcac 
tcaagagaca 
gcttttggaa 
aacctggtag 
gagaatgtcc 
tacgataccc 
gtctaggtaa 
tacggcctct 
tgtgtagacc 
gtctctgcct 
cctacctatc 
acagacggct 
aacagcctgc 
gattcccgcc 
cacattcact 
aaccatcacc 



cgactgcatc 
gacaagcaac 
cgactcaact 
caagcagacc 
gccaggcccc 
tcgagctgtg 
agacaacaat 
tgctccgcca 
agagtaatac 
tttttgagtt 
ctacacaaaa 
tagagccgtc 
gacattctat 
acctgacatg 
ccttccattg 
gacaatgtcc 
gagagctgga 
atccaacaat 
caactcctct 
acctttcaag 



cattttgcac 60 
acaccaatga 120 
ctacccaccg 180 
aactgcccct 24 0 
caccaccaag 300 
ccttgaaatt 360 
gagagagaga 420 
atgcgtctga 4 80. 
gtgttttgcg 540 
gatgcggtgc 600 
ggcctatact 660 
atagatccta 720 
gaataatcat 780 
tagcagtggc 840 
tacagcggca 900 
gtataccacc 960 
agatgaagag 1020 
cttttccatc 1080 
ttctcaactc 1140 

.1190 



<210> 36 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: PGR 5* primer 
<400> 36 

aagcttgcat gcctgcatcc 20 

<210> 37 
<211> 26 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: PGR 3* primer 
<400> 37 

ccatggtgaa aggtggtgat ggttgg 26 
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1 



<210> 38 
<211> 13 
<212> PRT 

<213> Trichoderma reesei 
<220> 

<221> misc_feature 

<223> vild type T. reesei EGI peptide linker 
<400> 38 

Val Pro Arg Gly Ser Ser Ser Gly Thr Ala Pro Gly Gly 
1 5 10 . 



<210> 39 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: modified CBHII linker 
<400> 39 

Gly Ser Ser Ser Gly Thr Ala Pro Gly Gly 
1 5 10 



<210> 40 
<211> 19 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Met /Thrombin 
linker 

<400> 40 

Pro Gly Arg Pro Val Leu Thr Gly Pro Gly Met Gly Thr Ser Thr Ser 
1 5 10 15 

Ala Gly Pro 



<210> 41 
<211> 13 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Met-containing linker 
<400> 41 
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Pro Gly Ala Ser Thr Ser Thr Gly Met Gly Pro Gly Gly 
15 10 



<210> 42 
<211> 14 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: linker containing the thrombin 
cleavage site 

<400> 42 

Gly Thr Leu Val Pro Arg Gly Pro Ala Gly Val Asn Leu Val 
15 10 



<210> 43 

<211> 16 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic oligonucleotide 

NhelBglllNhel of the pTNS15 plasmid • 

<400> 43 

gctagagatc tctagc 16 

<210> 44 . ^ ' 

<211> 9 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic peptide AocIXbalAocI 

the pTNS15 plasmid 

<4G0> 44 

Ala Ser Gly Ala Ser Arg Ala Ser Gly 
1 5 



<210> 45 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic oligonucleotide 

AocIXbalAocI of the pTNSlS plasmid 

<400> 45 
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gcctcaggag cctctagagc ttcagga 



27 



<210> 46 
<211> 20 
<212> PRT 

<213> Trichoderma reesei 
<400> 46 

Ala Asn Ala Phe Cys Pro Glu Gly Leu Leu Tyr Thr Asn Pro Leu Cys 
15 10 15 



Cys Asp Leu Leu 

20 



21 



